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(57) ABSTRACT 

For broadcasting purposes a multi-channel audio encoder 
board has been designed. A requirement for such encoders is 
that they are able to operate with different encoding param- 
eters. It may happen that encoding parameters change during 
encoding operation. In order to avoid the output of invalid 
data the encoding parameters required for a specific pro- 
cessing path are added to the input streams for the audio 
channels and become linked with the associated audio data 
and are stored in various buffers together with its audio data, 
i.e. the corresponding encoding parameters are kept linked 
with the audio data to be encoded throughout the encoding 
processing. 
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METHOD AND APPARATUS FOR parameters are kept linked with the audio data to be encoded 

ENCODING OR DECODING AUDIO OR throughout the encoding processing in the different data 

VIDEO FRAME DATA streams and data paths. Preferably the original encoding 

parameters assigned to the processing paths become con- 
5 verted to a different format in order to minimise the required 

FIELD OF THE INVENTION word length and/or to facilitate easy evaluation in the related 

The invention relates to a method and to an apparatus for processing stages, 

encoding or decoding audio or video frame data. Thereby each data stream can be processed with the 

correct parameter set without waiting for finishing encoding 

BACKGROUND OF THE INVENTION 10 °f me old data stream and for reset and loading of new 

parameters before starting encoding of a new data stream 

For broadcasting purposes a 4-stereo-channel MPEG W1 t n oew parameters. 

audio encoder board has been designed. A requirement for . , . . , , , 

, i . . ■ , . f . j-jr * Hie invention can also be used in audio or video decoders 

such encoders is that they are able to operate with different . .. , r 

4 x/nno ii • i whith a corresponding inverse order of processing stages, 

encoding parameters. MPEG allows e.g. various sample is 

frequencies and overall data rates. ~ * ^ P rmci P le > ^e inventive method is suited for encoding 

audio or video frame data for which encoding parameters are 

SUMMARY OF THE INVENTION required, wherein the required encoding parameters become 

linked at the input of the processing with frames of said 

A problem arises when during normal encoding operation audio or video data t0 be encoded and throughout different 

one or more parameters change. This may happen when the stages in the encoding processing, and wherein in each of 

current type of program changes, e.g. from pure speech or these s stages the corresponding encoding parameters linked 

news to music. ^th CUIT ent f rame data to be processed are regarded in order 

Normally the audio frames are processed in an encoder in to allow switching of the encoding parameters for any frame 

subsequent different stages, for example conversion to fre- 25 thereby avoiding encoding of invalid output data without 

quency coefficients in a first stage and bit allocation and reset, or: 

quantisation in a further stage. In a path in parallel to the first for decoding audio or video frame data for which decod- 

stage the psychoacoustic masking is calculated. A video ing parameters are required, wherein the required decoding 

encoder includes the following stages: block difference parameters become linked at the input of the processing with 

stage, DCT (discrete cosine transform), quantisation and in 3Q frames of said audio or video data t0 ^ dec oded and 

the feedback loop inverse quantisation, inverse DCT, motion throughout different stages in the decoding processing, and 

compensated interpolation the output of which is input to the wherein in each of theses stages the corresponding decoding 

block difference stage, wherein the output of the quantisa- parameters linked with current frame data to be processed 

tion is possibly VLC (variable length coding) encoded and are regarded in order to allow switching of the decoding 

buffered before final output and the buffer filling level is 35 parameters for any frame thereby avoiding decoding of 

used to control the quantisation in such a way that encoding invalid output data without reset. 

artefacts are masked as far as possible. If the encoding , ♦ ♦ • • , , f ,. 

I, ■ • • . . i_ In principle the inventive apparatus is suited for encoding 

parameters woidd generally change at a tune instant where audio P or J eoframedata iot ZiO, encoding parameters are 

a certain audio frame has been processed m such first stage requ i r ed and includes- 

but not yet in such further stage, the data of this frame will An 

become useless after having been processed in the further 40 means for lmkin g the re <l mred encoding parameters with 

stage with the changed encoding parameters. frames of 531(1 audl ° or Vlde0 data » said linkin S means 

In order to avoid such problem a big table with the old arran S ed near the in P ut of the a PP aratus > 

encoding parameters and a big table with the new encoding mcans for converting time domain samples into frequency 

parameters could be stored in the encoder for some time for 45 domain coefficients, to the input of which means buffer 

each channel wherein the 'depth' of the tables would depend means are assigned; 

on the number of streams within the encoder and which means for calculating masking properties from said time 

would require repeated updating. All processing stages of domain samples, to the input of which means buffer 

the encoder would need to have access to both channel tables means are assigned; 

and would need to determine at which time to access which 50 means for performing bit allocation and quantisation of 

of the tables. In particular in a multichannel encoder in the coefficients under the control of the output of said 

which different channels may change different encoding masking calculating means, to the input of which bit 

parameters at different times, the channels possibly being allocation and quantisation means buffer means are 

assigned to different microprocessors, this solution could assigned, 

easily produce errors. The tables would require more wherein in said conversion means, in said masking calcu- 

memory capacity than the solution described below. lating means and in said bit allocation and quantisation 

It is one object of the invention to disclose a method for means the corresponding encoding parameters linked with 

encoding or decoding audio or video frame data for which current frame data to be processed are regarded in order to 

encoding or decoding parameters are required. It is a further allow switching of the encoding parameters for any frame 

object of the invention to disclose an apparatus which 60 thereby avoiding encoding of invalid output data without 

utilises the inventive method. reset. 

In the invention intermediately stored general parameter BRIEF DESCRIPTION OF THE DRAWINGS 
tables are not used. Instead, the encoding parameters 

required for a specific processing path are added to the input Embodiments of the invention arc described with rcfer- 

streams for the audio channels and become linked with the 65 ence to the accompanying drawings, which show in: 

associated audio data and are stored in the various buffers FIG. 1 functional block diagram of a 4-channel audio 

together with its audio data, i.e. the corresponding encoding encoder; 
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FIG. 2 linked data field including audio data to be (52B), PsycholC_L (42C), PsycholC_R (52C), 

encoded and associated encoding parameters; PsycholD_L (42D) and PsycholD_R (52D), respectively. 

FIG. 3 microprocessor with memory including linked data The subband filters divide the total audio spectrum into 

fields. frequency bands, possibly using FFT, and may calculate the 

DETAILED DESCRIPTION OF PREFERRED 5 E^^,2fi^S^? f° * 

T-xATinrMxyfrKrrc band or subband. Within the frequency bands a normalisa- 

EMBODIMENTS 4 . , . . , , ^ . _/ . . 

tion may be earned out. The subband filters take into account 

The audio encoder (100) in FIG. 1 receives four stereo the relevant encoding parameters read from the corrcspond- 

PCM input signals PCMA (2A), PCMB (2B), PCMC (2C) mg upstream asynchronous buffer. 

and PCMD (2D). E.g. MPEG audio data are frame based, ™ 

each frame containing 1152 mono or stereo samples. The ^ first psychoacoustic calculators perform an FFT 

encoder operating system of FIG. 1 may include six DSPs having a length of e.g. 1024 samples and determine the 

(Digital Signal Processor, not depicted) for the encoding of current masking information. Each first psychoacoustic cal- 

the four MPEG channels. These DSPs form a software culator can be followed by a second psychoacoustic calcu- 

encoder which includes the technical functions depicted in 15 lator Psycbo2A_L (62A), Psycho2A_R (72A), 

FIG. 1. A suitable type of DSP is for example ADSP 21060 Psycho2B_L (62B), Psycho2B_R (72B), Psycho2C_L 

or 21061 or 21062 of Analog Devices. As an alternative, the (62C), Psycho2C_R (72C), Psycho2D_L (62D) and 

technical functions depicted in FIG. 1 can be realised in Psycho2D_R (72D), respectively, which evaluates the 

hardware. maximum or scale factor v alues previously calculated in the 

Synchronisation of the software running on the six DSPs, 20 subband filters. The first and second psychoacoustic calcu- 

or on corresponding hardware, is achieved using FIFO lators take into account the relevant encoding parameters 

buffers wherein each buffer is assigned to one or some read from the corresponding upstream asynchronous buffers, 
specific frames. This means that at a certain time instant a 

current frame as well as previous frames, the number of The output signals of Psycho2/L_L (62A), Psycho2A_R 

which depends from the quantity of available buffers, are 25 ( 72A )> Psycho2B_L (62B), Psycho2B_R (72B), 

present in the processing stages. Psycho2C_L (62C), Psycho2C_R (72C), Psycho2D_L 

Aglobal parameter switching would cause assignment of < 62D > and p^VS mJ"^^?, 

the new parameters to also such buffers which still contain ^ ^ ' ^ 

data to be processed by the previous set of parameters. This LC (8 f f > "l 3 ' 1 ^?, { *™} de f 

would make the content of such buffers useless. In the 30 >* e mm ^! °f bltS a . llocated the quantoaUon the 

... ,., , audio data coefficients coming from the associated subband 

invention, however, various encoding parameters like cod- . . ~ Tt . , mi. i i * • *l 

■ A \ ^ ' ^ , • • , Jr m *\ M i n „* 0 , n j filter via a buffer. It is also possible to calculate in the second 

ing mode (mono, stereo, dual, joint stereo), sample rate and , . . . 4 r . ..... ..... , 

, 7 4 v . i i l n i -,u . , i psychoacoustic calculators in addition what is being calcu- 

data rale can be changed on the lly without reset and \ f , . A , c t . . . . 4 . . 

tU , . . Tm , : . j . lated in the first psychoacoustic calculators and thereby to 

without producing invalid encoder output data. „ .u « * u i i « 

„ v * , . . «. ^ 35 omit me firs! psychoacoustic calculators. 

Between some of the stages asynchronous buffers ASBUF 

are inserted which allow asynchronous write and read opera- Finally, the outputs of Bal/Q/E_A (82A), Bal/Q/E_B 

tions. Between other stages synchronous buffers BUF are (82B), Bal/Q/E_C (82C) and Bal/Q/E_D (82D) pass 

sufficient. The PCM input signals PCMA(2A), PCMB (2B), through an asynchronous buffers and output interfaces AES- 

PCMC (2C) and PCMD (2D) each pass via an asynchronous 40 EBU_A (92A), AES-EBU_B (92B), AES-EBU_C (92C), 

buffer to a respective converter CONA(12A), CONB (12B), AES-EBU_D (92D), respectively, which deliver the 

CONC (12C) and COND (12D). In such converter an encoder stereo output signals PCM_Out_A (96 A), PCM_ 

integer-to-floating representation conversion of the audio Out_B (96B), PCM_Out_C (96C), PCM_Out_D (96D), 

samples to be encoded may take place. It is also possible that respectively. 

the encoder processes integer representation audio samples. 45 pjQ. 2 shows a data field (200) including audio samples 

In such converter also one or more kinds of energy levels or aut ji 0 coefficients COE (205) for a frame. To these 

in a frame may be calculated, e.g. energy of all samples of samples or coefficients encoding or decoding parameters 

the frame or average energy of the samples of a frame. These par (210) are linked or assigned. PAR (210) includes for 

energy values may be used in the subsequent psychoacoustic instance mode information (mono, stereo, dual, joint stereo), 

processing. 50 sample rate and data rate information, length of the data 

In addition, in such converter the possibly adapted encod- field, type of MPEG layer. An address pointer POI (215) 

ing parameters become linked with the frame audio data. In indicates the beginning of the parameter data PAR (210). 

respective parameter encoders PENCA (22A), PENCB ¥ ^ „ ^ ori „ . . 

(22B),PENCC(22C) and PENCD(22D) the original encod- In t FIG \ 3 l .! microprocessor or DSP //P (305) is shown 

ing parameters may be converted as described above and 55 "P"^™? M ™ (310) ' In . ^ 

then fed to CONA (12A), CONB (12B), CONC (12C) and data fields A t0 F ( 302A t0 302F > respectively) are depicted 

COND (12D), respectively. In an MPEG decoder the decod- ^ C \^™^^ in FIG " 2 " E g ' d * ta 

ing parameters in the transmitted datastream may be adapted fields A (302 A), B (302B) and C (302C) may correspond to 

correspondingly according to the hardware or software data fields of three su^ 

requirements in the decoder before being (re-)linked to each 6 o data P aths of HG * 1 » ata fl f ld A (^A) may include other 

data frame encoding parameters PAR than data fields B (302B) and C 

Via asynchronous buffers the output data of CONA(12A), < 3 ° 2 ?; ™ e ad * ress ° f ^ n ( ?j n « ° fi f ?? " D * 

CONB (12B), CONC (12C) and COND (12D) are fed in c» ^ated by adduig the length of data field A (302A) to POI 



parallel to sub-band filters SUBA (32A), SUBB (32B), 



SUBC (32C) and SUBD (32D) and to first left and right 65 The software running on ft? (305) can use the following 
channel psychoacoustic calculators PsycholA^L (42A), example commands in C-language for constructing the data 
PsycholA_R (52A), PsycholB_L (42B), PsycholB_R fields according to FIG. 2: 
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typcdcf struct { 



int bitrate_J.ndex 

int sampling_frequency 



} layer; 

"struct" may also contain time stamp information. 

#define FRAMESIZE 1152 /"1152 is a decimal number'/ 

typedef struct } 



layer info 



float PCMBuf[ FRAMESIZE] 
} FloatBu£fer_L_TVpe 
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The invention can be used e.g. for MPEG 1, 2 and 4 Audio 
encoding and decoding for MPEG layers 1, 2 or 3, Digital 
Video Broadcast DVB, for AC-3, MD and AAC processing, 
for DVD processing and Internet applications concerning 
audio or video data encoding and decoding. 

What is claimed is: 

1. Method for encoding at least one of audio and video 
frame data for which encoding parameters are required, 
including the following steps: 

linking the required encoding parameters at the input of 
the processing with frames of said at least one of audio 
and video data to be encoded; 

keeping the required encoding parameters linked through- 
out different subsequent stages in the encoding 
processing, wherein in each of theses stages the corre- 
sponding encoding parameters linked with current 
frame data to be processed are regarded in order to 
allow switching of the encoding parameters for any 
frame thereby avoiding encoding of invalid output data 
without reset. 

2. Method according to claim 1, wherein the required 
encoding parameters become converted to a different format 
before being linked to the frames of said at least one of audio 
and video data to be encoded or decoded. 

3. Method according to claim 1, wherein to each stage an 
asynchronous buffer is assigned to an input of the stage and 
wherein the asynchronous buffer contains data fields includ- 
ing said at least one of audio and video frame data and the 
required encoding parameters. 

4. Method according to claim 2, wherein to each stage an 
asynchronous buffer is assigned to an input of the stage and 
wherein the asynchronous buffer contains data fields includ- 
ing said at least one of audio and video frame data and the 
required encoding parameters. 

5. Method for decoding at least one of audio and video 
frame data for which decoding parameters arc required, 
including the following steps: 

linking the required decoding parameters at the input of 
the processing with frames of said at least one of audio 
and video data to be decoded; 
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keeping the required decoding parameters linked through- 
out different subsequent stages in the decoding 
processing, wherein in each of theses stages the corre- 
sponding decoding parameters linked with current 
frame data to be processed are regarded in order to 
allow switching of the decoding parameters for any 
frame thereby avoiding decoding of invalid output data 
without reset. 

6. Method according to claim 5, wherein the required 
decoding parameters become converted to a different format 
before being linked to the frames of said at least one of audio 
and video data to be encoded or decoded. 

7. Method according to claim 5, wherein to each stage an 
asynchronous buffer is assigned to an input of the stage and 
wherein the asynchronous buffer contains data fields includ- 
ing said at least one of audio and video frame data and the 
required decoding parameters. 

8. Method according to claim 6, wherein to each stage an 
asynchronous buffer is assigned to an input of the stage and 
wherein the asynchronous buffer contains data fields includ- 
ing said at least one of audio and video frame data and the 
required decoding parameters. 

9. Apparatus for encoding at least one of audio and video 
frame data for which encoding parameters are required, and 
including: 

means for linking the required encoding parameters with 
frames of said at least one of audio and video data, said 
linking means being arranged near the input of the 
apparatus; 

means for converting time domain samples into frequency 
domain coefficients, to the input of which means buffer 
means are assigned; 

means for calculating masking properties from said time 
domain samples, to the input of which means buffer 
means are assigned; 

means for performing bit allocation and quantisation of 
the coefficients under the control of the output of said 
masking calculating means, to the input of which bit 
allocation and quantisation means buffer means are 
assigned, 

wherein in said conversion means, in said masking calcu- 
lating means and in said bit allocation and quantisation 
means the corresponding encoding parameters linked with 
current frame data to be processed are regarded in order to 
allow switching of the encoding parameters for any frame 
thereby avoiding encoding of invalid output data without 
reset. 

10. Apparatus according to claim 9, wherein the required 
encoding parameters become converted to a different format 
before being linked to the frames of said audio or video data 
to be encoded or decoded. 

11. Apparatus according to claim 9, wherein said buffer 
means contain data fields including said at least one of audio 
and video frame data and the required encoding parameters. 

12. Apparatus according to claim 10, wherein said buffer 
means contain data fields including at least one of audio and 
video frame data and the required encoding parameters. 
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